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Abstract 

Density expansions for hypoelliptic diffusions (X^ , . . . are revisited. In particular, we 

are interested in density expansions of the projection [X^, . . . , Xip^ , at time T > 0, with I < d. 
Global conditions are found which replace the well-known "not-in-cutlocus" condition known 
from heat-kernel asymptotics; cf. G. Ben Arous [S]. Our small noise expansion allows for a 
"second order" exponential factor. Applications include tail and implied volatility asymptotics 
in some correlated stochastic volatility models; in particular, we solve a problem left open by 
A. Gulisashvih and E.M. Stein (2009). 

Keywords: Laplace method on Wiener space, generalized density expansions in small noise 
and small time, sub-Riemannian geometry with drift, focal points, stochastic volatility, implied 
volatility, large strike and small time asymptotics for implied volatility 

1 Introduction 

Given a multi-dimensional hypoelliptic diffusion process Xj = (^X^ , . . . , : i > O) , started at 
Xo = xq, we are interested in the behaviour of the probability density function / = f{y,t) of 
the projected (in general non-Markovian) process 

Yt n, o Xt {X^, . . . , Xl) , 1 < I < d. 

Both short time asymptotics and tail asymptotics, in presence of some scahng, can be derived from 
the small noise problem 

dXf = b (e, X^) dt + ea (Xf ) rfW*, with Xg = e R'^. 

Our main technical result, based on the Laplace method on Wiener space following Ben Arous 
[H [H], is a density expansion for :— H/ o X^ of the form, for xq, y, T fixed, 

r (y, T) = e--^''\--'^e-' (CO + O [e)) as e i 0. (1) 

Leaving definitions and precise statements to the main text below (cf theorem [9]) let us briefly 
mention our key assumptions 

(i) a strong Hormandcr condition at all points (or in fact, a weak Hormander condition at xq and 
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an explicit controllability condition); 

(ii) existence of at most finitely many minimizers in the control problem which govern the leader 
order behaviour; 

(iii) invertibility of the deterministic Malliavin covariance matrix along the minimizers; 

(iv) a global condition on xq e M'^, y e R' which we call non-focality; motivated by geometric 
terminology [151 [T^ . 

Conditions (i)-(iii) will not surprise the reader familiar with the works [3 IH [TTJ |49] . However, 
condition (ivj3 which guarantees non- degeneracy of the minimizers (cf. proposition [7]) , appears to 
be new in the context of density expansions, to the best of our knowledge, even in the Riemannian 
case. It forms the essence of what is needed to extend the well-known point-point concept of non- 
conjugacy (crucial part of the cut-locus condition" familiar from heat kernel expansions) to a 
(sub-Riemannian, with drift) point- suh space setting. A simple example where (iv) and ([1]) fails, is 
given in section [4.21 We emphasize that our applications require us to introduce and characterize 
non-focality in a control-theoretic generality. 

As far as the expansion ([T]) is concerned, we draw attention to (in the context of density ex- 
pansions) the somewhat unusual second order exponential factor present when C2 ^ 0. As was 
understood in the context of the general Laplace method on Wienerspace, [3 [9], this has to do 
with allowing drift vector field 6 (and in the present paper also: starting point) depend on e in 
first order; the special case that arises from considering short time asymptotics - the small noise 
parameter e is then introduced by Brownian scaling - always leads to C2 = 0. It is interesting to 
note the work of Kusuoka-Stroock [5S] , concerning precise asymptotics for Wiener functionals (in 
the small noise limit), see also |351|35] for recent applications to projected diffusions, was set up as 
expansion in . This is enough to cover the model case of short time expansions, but cannot yield 
an expansion of the type ([T]) with C2 7^ 0. A similar remark applies to the small noise expansions 
for projected diffusions due to Takanobu-Watanabe [49] . 

One of our main motivations comes from recent density expansions by Gulisashvili-Stein. In |27i 
Theorem 2.1] they prove that the stock-price in the uncorrelated Stein-Stein stochastic volatility 
model admits a density with expansioiH 

Bis-^'e^^"^^ {log sy^ (l + 0(logs)~^) ass too 

with explicitly computable constants; asymptotic formulae of the implied volatility in the large 
(similar: small) strike regime are then obtained as corollaries. When writing this expansion in 
terms of log-price Y — logS*, it indeed has the form (|T]) with y — logs = More generally, 

we can show from rather general and robust principles that the tail behaviour of Yt E for fixed 
T > 0, subject to a certain scaling with parameter e {1, 2} in the full Markovian specification of 
the model, has the form 

f{y,T) = e-^^y''"e^^y''\i''[co + o{l/y'^')) as y t 00. (2) 

It is worth mentioning that such an expansion leads immediately to call price and then (Black- 
Scholes) implied volatility expansions in the large strike regime, cf. [271 [2S]; in the case 9 = 2 

^More precisely, xq G M'' must not be focal for the submanifold Ny := (y, ■) C M''. The classical example here is 
of course (0, 0) £ M? which is focal for unit circle CM?. 

^Strictly speaking, their O-term is logs with power —1/4; the authors have informed us, however, that a closer 
look at their argument indeed gives power —1/2. 
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typical for stochastic volatility, 

<yBS {k,TfT^{l3^k + + o{\)f as log-strike fc oo; 

/32 = C2V2 (i/V^T^ - . 

(Small strike asymptotics are similar and will not be discussed here.) The leading order behaviour 
described by /S^ — Pi (ci) is well understood [37l[7]; the second order behaviour is given by P2 — 
P2 (ci,C2). Further terms in this expansion are in principle possible |29j : in particular, the next 
term would involve cq. When applied to the Stein-Stein stochastic volatility modelH the afore- 
mentioned scaling indeed leads to a small-noise, hypoelliptic diffusion problem with non-vanishing 
second order exponential factor, as is handled by our main theorem. We then solve a problem left 
open in the afore-mentioned work |27( Theorem 2.1] in that we are able to compute the expansion 
in the correlated case. The importance of allowing for correlation in stochastic volatility models 
is well-documented, e.g. [20l[38], and evidence from estimation of parametric stochastic volatility 
models suggests correlation parameter p « —0.7 or p « —0.8 for S&P 500, for instance; a finding 
fairly robust across models and time periods [1 . With this in mind, we shall focus on the case 
— 1 < /5 < in our explicit analysis and derive explicit expressions for ci , C2 . (In principle, the 
Laplace method on which we rely yields an explicit expression for cq, cf. [9l Thm 4, p 135], |35j.) 

Density expansions of diffusions in the small noise regime seem to go back (at least) to [34] : 
density expansions for projected diffusions in the small noise regime (which include the short time 
regime), with applications to implied volatility expansions, were recently considered by Y. Osajima 
[42], based on work with S. Kusuoka [35]. We partially improve on these results. First, as was 
already mentioned, C2 = in these works which makes any expansion of the form ([2]) out of reach. 
Additionally, in comparison with [42] we do not assume xq near (y, •), nor ellipticity of the problem. 
In further contrast to (the general results in) [351 131] we provide a checkable, finite-dimensional 
criterion that guarantees that the crucial infinite-dimensional non-degeneracy assumption, left as 
such in [35l[36], is actually satisfied. On the other hand, these authors give explicit formulae for cq 
which we (presently) do not. 

Finally, our expansion ([1]) leads to short time expansion for projected diffusion densities, under 
global conditions on (xo,y), of the form 

/(y,0 ~ Co (xo,y) ^exp ^-^-^1^^^ as i | 0. (3) 

When I = d, such expansions go back to classical works ranging from Molchanov [ID] to Ben Arous 
[8]. The leading order behaviour 2t\ogf (y, t) ^ —d^ (xg, y) is due to Varadhan [50]. The case I < d, 
in particular our global condition on (xq, y), appears to be new. That said, expansions of this form 
have appeared in [49] |30j |42] ; the last two references aimed at implied volatility expansions. In the 
context of a time-homogenuous local volatility models {I ^ d = 1), the expansion ([3]) holds trivially 
without any conditions on (xo,y); the resulting expansion was derived (with explicit constant cq) 
in [22]. Subject to mild technical conditions on the diffusion coefficient, they show how to deduce 
first a call price and then an implied volatility expansion in the short time (to maturity) regime: 

o'BS {k, t) = k/d (xo, k) + c (xq, k)t + {f) as t 4- 0; 

•^In fact, the leading order behaviour of the density was discussed with large deviations methods in 16, p40, p265]. 
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where d (xq, k) is a point-point distance and c (xq, k) is explicitly given. The celebrated Berestycki- 
Busca-Florent (BBF) formula [13] asserts that ass {k,t) ^ k/d(xo,k) as t | 0, is in fact valid in 
generic stochastic volatilty models, d(xo,A;) is then understood as point-hyperplane distance. In 
fact, k/d{xo,k) arose as initial condition of a non-linear evolution equation for the entire implied 
volatility surface. As briefly indicated in [13l Sec 6.3] this can be used for a Taylor expansion 
of (TBS {k,t) in t. Such expansions have also been discussed, based on heat kernel expansions on 
Riemannian manifolds by |15[ 1311 143] . not always in full mathematical rigor. Some mathematical 
results are given in [32], assuming ellipticity and close-to-the-moneyness \k\ << 1; see also forth- 
coming work by Ben Arous-Laurence [TO] ■ We suspect that our formula ^ , potentially applicable 
far-from-the-money, will prove useful in this context and shall return to this in future work. 

It should be noted, that the BBF formula alone can be obtained from soft large deviation 
arguments, cf. O Sec. 3.2.1] and the references therein. In a similar spirit, the Varadhan-type 
formula 2t log / (y, t) ^ —d^ (xq, y), when I < d, can be shown, without any conditions on (xq, y) by 
large deviation methods, only relying on the existence of a reasonable density, cf. [51|, Sec 5, Rmk 
2.9]. 

As a final note, we recall that the (in general, non-Markovian) R'-valued Ito-process (Yj : t >0) 
admits - subject to some technical assumptions [281l45j - a Markovian (or Gydngy) projection. That 
is, a time-inhomogeneous Markov diffusion (Yj : t > 0) with matching time-marginals i.e Yt = Yt 
(in law) for every fixed t > 0. In a financial context, when I = 1, this process is known as (Dupire) 
local volatility model and various authors [131 dH 1311 dD] have used this as an important intermediate 
step in computing implied volatility in stochastic volatility models. Since all our expansions (small 
noise, tail, short time ) are relative to such time-marginals they may also be viewed as expansions 
for the corresponding Markovian projections. 

Acknowledgement: JDD, PKF and AJ would like to thank MATHEON for financial support. 
PKF would like to thank G. Ben Arous for pointing out conceptual similarities in [231|5] and several 
discussions thereafter. It is also a pleasure to thank F. Baudoin, J. P. Gauthier, A. Gulisashvili and 
P. Laurence for their interest and feedback. 

2 The main result and its corollaries 

Consider a d-dimensional diffusion (Xj)^^^ given by the stochastic differential equation 

dXl ^b{e,Xl)dt + ea{Xl)dWt, with = xg e R"* (4) 

and where W = {W^, . . . , VF"') is an m-dimensional Brownian motion. Unless otherwise stated, 
we assume b : [0, 1) x M'' ^ R'^, a = (cti, . . . ,cr™) -.W^ ^ L and Xq : [0, 1) to be 

smooth, bounded with bounded derivatives of all orders. Set — b (0, •) and assume that, for 
every multiindex a, the drift vector fields b{e,-) converges to ctq in the senseQ 

d^b (e, •) — >• d^b (0, •) = d^ao (■) uniformly on compacts as e 4- 0. (5) 

We shall also assume that 

deb [e, •) deb (0, •) uniformly on compacts as e J, (6) 

*If (|4]l is understood in Stratonovich sense, so that dW is replaced by odW, the drift vector field b (e, ■) is changed 
to b (e, ■) = b (e, ■) — (e^/2) ' dai. In particular, o-q is also the limit of b (e, ■) in the sense of {S} . 
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and 

Xq = xo + £Xq + o (e) as e I 0. (7) 

Later applications to stochastic volatility models aside, the (technical) main result of this paper 
is a density expansion in e of the M'-valued (in general, non-Markovian) projectior0 

Yf, := o X?p (x^\ . . . G M'; 

where H; denotes the projection (x^, . . . ,a;'') i— > (a;^, . . . ,a;'), for fixed I E {!,... ,d} and T > 0. Of 

course, we need to guarantee that Yj. indeed admits a density. To this end, we make the standing 
assumption that weak Hormander condition holds at xq, 

span[CT, : 1 < i < m; [c7j,ak] : < J,k < m; ...]^^ ^ T^.W^] (H) 

that is the linear span of di , . . . , cr„ and all Lie brackets of CTq , CTi , . . . , a„i is full. Since this condition 
is "open" it also holds, thanks to ([S]), for e > small enough, with ctq and xg replaced by b {e, •) (or 
b(e, •), cf. previous footnote) and Xq, respectively. It then is a classical result (due to Hormander, 
Malliavin) that the R'^-valued r.v. X|, admits a (smooth) density for all times T > and so does 
its R'-valued projection Y^^. We denote the probability density of Yj. by 

/a-,r) = /s(y,T) withyeR'. 

In theorem [2] below, it will be assumed that ICy is non-empty, where for fixed a e R' we definelfl 

/C.„,T;a = /Ca {h e i/ : o (h) = a} . (8) 

Here, H denotes the Cameron-Martin space, i.e. absolutely continuous paths with derivative in 
([0, r] , M™), and <f>q. (h) denotes the time-T solution to the controlled ordinary differential equa- 
tion 

d4 = ao dt + J2<^^ (0?) dhl, cj,'^ = xo e R-^. (9) 

Let us also define tp^^^ (h) := (h) o (i^^ (h))~^; at occasions we shall also write (j)^ (xq) resp. 
'/'T<-t (^o) instead of (prp (h) resp. $T^t (h). We note that (jjrp (h) is a diffeomorphism, as function 
of Xq € M.'^, and denotes its differential by ^r^t (h). 

A well-known sufficient condition for /Ca 7^ is the strong Hormander conditio?^ 

Vx e R-^ : Lie [cti, . . . ,a„,] U = T.R'' = R''- (HI) 



Whenever /Ca 7^ 0; it makes sense to define the energy and the set of minimizers 

A,„,T(a): = A(a) inf |i||h||2, : h e /Caj , (10) 
^r:={hoe/Ca:i||ho|||, = A(a)|. 

*Due to the non-Markovianity of the problem, PDE techniques are poorly suited to study the density of YJ,. 
^In later applications to tail asymptotics, when 1 = 1, we have a = 1 and e := 1/y or as f 00. For this 

reason we prefer to keep a and y at this stage separated. 

''A weak Hormander type condition which ensures /Ca ^ is found in I33| . 



5 



In words, A (a) is the minimal energy required to go in time T from xq G to the "target" 
submanifold 

N := {x G R'^ : Hi (x) = (a;\ . . . , x') = a} . 

Elements of /C™'" will be called minimizers or minimizing controls. A standard weak-compactness 
argument shows that JCa already implies that /C™'" is non-empty. (Throughout the paper, we 
shall only be concerned with the situation that /C™'" contains one or finitely many minimizers.) 

Following Bismut [M] it will be crucial that /Ca enjoys a (Hilbert) manifold structure, locally 
around (each) hg G A^™™- As is well-known, this can be guaranteed by assuming invertibility of 
C'xo,T;a (ho), the deterministic Malliavin matrix given by 

C.o,T;a (h) C(h) (Z?0j, (h) (h))^ € Lin (r,;M'' ^ X^P"^ - ^-^^"^ 



where x^ := 4>t (h) and D will always denote the (iJ- valued) Frechet derivative of some function 
depending on h g iJ. We can also view C (h) as (positive semi-definite) quadratic form on T*^R'^, 
in coordinates 



(C (h) p,p) = {D4)t (h) ,p,f where p = pdx"- G % 



XT-* • 



In fact, large parts of our analysis only rely on non-degeneracy of C (h) restricted to M'^' but we 
find it more convenient to deal with the "full" matrix C (hg). The following condition will cover 
most of our application^ 

Proposition 1 Assume h £ H and 

3t e [0, T] : span [ai, . . . , a^] = 

where Xj := (/i^ (h). Then C (h) is invertible. 

Proof. We have the well-known formula, for any k £ H, 

{D4>T (h) , k)^ = D(pj, (h) [fc] = / V $T^t (h) a, (xt) k^dt g TxtR'^ 

When pairing this with p = pidx^ g Tx'^R'', we have 

((p, D<Pt (h)) , fc)^ = V (p, <I>T^* (h) a, (xO) fc^di g Tx^R'^ 
and it easily follows that 

||(p,i^0^(h))||^= / E(p,<fT^t(h)a,(x,))'dt= / E((<I>T^t(h))>,a,(xi))'dt. 

"'0 „_i Jo „_i 



By assumption span[i7i, . . . , am] |xt = Tit^'^ for some t g [0, T], and this clearly remains valid in a 
small enough open interval containing t which is enough to conclude ($T.j_t (h))*p = 0. By non- 
degeneracy of the (co-)tangent flow, this implies p = and so C (h) is non-degenerate, as claimed. 



°A sufficient condition for "C (h) is invertible for every h ^ 0" in a strictly sub-elliptic setting is given as condition 
(H2) by |14| ; although much stronger than Hormander's condition, it does apply to examples such as the 3-dimensional 
Heisenberg group. 



6 



Remark 2 (Tangent space of /Ca) Assume C (h) is invertible. Then Kg. enjoys a (Hilbert) man- 
ifold structure, locally around h and 

%JC^ = VeiDiUi o (h) =: Fq- 

Moreover, 

{m 
keH -.^{p, $o^t (h) (xt)) fc^dt = OVp G span . . . , dx^] C 7;*^M'' 
3 = 1 

We now introduce the Hamiltonian 

^ m 

H (a;,p) : = (p, ctq (a;)) + 2 X! 

1=1 

= {P, o-o (a;)) + ^ {p, (o-cr'^) (a;)p) 

and Hf<_o =Hf^o {xq,Po) as the flow associated to the vector field {dpH, —dxH) on T*M''. (Under 
which H is invariant; it follows that H.^q does not explode.) 



Remark 3 Our setup here is tied to the SDE Q), driven by m independent Brownians W^, . . . , W"^ . 
Many stochastic models, notably in finance, are written in terms of correlated Brownians, i. e. with 
a non-trivial correlation matrix ft — (w*'-' '■ 1 < i, j < rrij , where d {W\ W^)^ = uj^^^dt. The Hamil- 
tonian then becomes 

n {x,p) = (p, ao {x)) + \ {p, {ana^) (x) p) . (f f ) 

The following propositions generalize the respective results in Bismut's book [14] (see also Ben 
Arous 8j Theorems 1.15 and 1.1.8]) from a drift- free (ctq = 0), point-to-point setting {xq G K'' to 
y e W^) to a point-to-subspace setting (xq G K'^ to (y, •) € © R''-') with drift vector field ao. 
Note that the Bismut setting [T31 Chapter I] is recovered by taking zero drift, ctq = 0, and / = d. 



Proposition 4 // (i) hp G /C™™ is a minimizing control and (ii) the deterministic Malliavin co- 
variance matrix C (ho) is invertible then there exists a unique po = po (ho) G Tx^M'', in /ac{^ 

Po e (^'o^T (ho))*span{da;\...,dx'} |^^(xo), 

such that 

0^ (xo) = TrHt^o (xo, Po) , < i < r (12) 

(TT denotes the projection from 7~*M'^ onto ; in coordinates tt (x, p) ~ x). 

Moreover, (x (t) , p (t)) ■.=Ht^o (xo, po) so/ues f/ie Hamiltonian ODEs in r*M'^ = M'' M'^ 

x\ / dpH{x{t),p(t)) 

p \-d.:,n{x{t),p{t)) 



(13) 



^The (global) coordinate chart (x^ , . . . , x'^) of R"* induces coordinates co- vectors fields (or one-forms) 
{dx'^ , . . . , dx'^) . 
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the minimizing control ho = hg (•) is recovered by 



i^i (x (•)),?(•)> 

ho = I ... I (14) 



((T„(X (•)),?(•)), 

and with C := "H (x (t) , p (t)) independent of t Cz [0, T], 



1 TC 1 

A(y)= 2l!ho||l, = — -- {aoix{t)),p{t))dt. 



(15) 



At last, crucial for actual computations, (x (t) , p (t)) —Hti^o (xo, po) satisfies the Hamiltonian ODEs 
as boundary value problem, subject to the following initial -, terminal - and transversality 



conditions, 

x (0) = Xo e R'^ 

x(r) = (y,.) eM'eM'*-' 

p(r)=-(-,o) eE'eR"*"'. (i6) 

Proof. The key remark, due to Bismut 14, Chapter I], is that under the assumption "3C(ho) ^" 
the set K,™™ can be described by Hamilton-Jacobi theory. It then suffices to adapt the arguments of 
Bismut, as done in the drift-free case by Takanobu-Watanabe, ^9]. Let us note that the additional 
drift vector field cfq is trivially incorporated in their setting by adding a 0th component to the 
controls, i.e. (t) — t. The boundary conditions - in particular, transversality, have not been 
pointed out explicitly in [49] although are implicitly contained in their formulation. In fact, formal 
application of Pontryagin's maximum principle leads precisely to the above boundary value problem; 
care is necessary, however, since without assuming invertibility of C (hg), one can be in the so-called 
"strictly abnormal" case; the above approach is then not possible. ■ 

Remark 5 Assume there exists a (smooth) map (a — £, a -|- e) 9 y i— > ho (y) G IC™™. Then 

A(y) = ^||ho(y)f =^ ayA(a) = (ho(a),5yho(a))^. 

On the other hand, we know from 11; (ho (y)) — y that 

(n,), D(j,T (ho (a)) [9yho (a)] = Id on R'; 

Using ho (y) = Dcj.^ {ho (y))* p (T) = D^^ i^o (y))* ml q (a), where (nO:g(a) = (g(a),0), we 
have 

(ho (a) , 9yho (a)) ^ = (g (a) , (H;), i'^T (ho (y)) <9yho (a))^ . 
It thus follows that the derivatives of the energy are given in terms ofq{a), 

'qi (a)' 




qi (a) 



(17) 



This can be a useful short when computing the energy from the Hamiltonian system. If #/C""" = 1 
for some a, and our non- degeneracy condition (ND) as introduced below is met, the existence of 
such a map ho (■) can be shown along the lines of 114\ Thm 1.26]. We shall not rely on formula 
|_?7| ) in the sequel although will find it confirmed in various examples. 
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Remark 6 (How to compute optimal controls ho) Proposition^ - as it stands - requires ho 
to be a minimizer and then, subject to condition (ii), provides us with some information about 
cj)^° (xo) , (x, p) = iJ.<_o (xo,po) and in particular allows us to reconstruct ho from the Hamiltonian 
flow (x, p), cf. equation |_?^[ ). That said, we can consider any solution to the boundary valued 
problem if jg |j . l75)j . say (x, p), and define a (possible non-minimizing) control path Hq via i.e. 

(ct, (x(i)),p(i))dt, i^l,...,m. 

Jo 

From (03), 

m 

dxt = dpH (xt, pt) dt = (Jo (xt) fit + ^ cTi (xt) {a^ (xt) , pt) dt 

i=l 

and so relation US]) remains valid i.e. (j)^" (xo) = xt. It follows that the boundary conditions valid 
for X (namely, fo = xq, II/fT = yj o,re also valid for (j>^° (xo) and hence ho G ICy. While we do not 
if ho A^™™, proposition^ guarantees that every minimizer ho € /C™™ can be found be the above 
procedure. We thus have the following recipe: 

(i) Argue a priori that C (ho) is invertible (or ignore and check in the end). 

(ii) Solve Hamiltonian ODEs as boundary value problem, cf. Iil3 \l .l\1 0il . Characterize all solution 
via the (non-empty!) set 

{po : Ht^o (xo,Po) = (xt,pt) satisfies {23), (US) } ; 

(Hi) For each such po, compute \\ho\W where ho is given by 

ho = / (cti (xt) ,pt) dt, i = l,...,rn; 
Jo 

(iv) The minimizing ho are precisely those elements in {ho '. as constructed in (ii),(iii)} which 
minimize energy ||fto|ll/- In particular then, 

A(y)-i||ho||l,. 

The following proposition is crucial. 

Proposition 7 Under the assumptions of the proposition^ in particular ho G A^™'" with associated 
Po = Po (ho) S T'xg^'^! the following are equivalent: 

(Hi) ho G /Ca is a non- degenerate minimum of the energy I := i|| • \\^ restricted to the Hilbert 
manifold /C"; i.e. 

I" (ho) [k,k]>0 VO ^ fc e ffo = Tho/Ca 

(Hi') xo is non- focal for N = (a, •) alongho in the sense that, with (kt,Pt) Ho^t (xo,Po (ho)) € 

"9((i.3)l(q,3)=(o,o)7r-ffo^T (xt + (0,3) ,pt + (q,0)) 

is non-degenerate (as d x d matrix; here we think of (q,^) G M' x R'*^' ^ R'' and recall that tt 
denotes the projection from T*M.'^ onto ; in coordinates tt (x,p) = x). 
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Proof. Let us give a quick proof of (iii') (iii) in the Riemannian setting, the general (sub- 
Riemannian, with drift) case is new and full proof is given in the next section. Since hp S 
we know that /" (ho) must be positive semi-definite. In particular, the index of ho, relative to the 
point-submanifold problem xq x N, is zero. By the Morse index theorem [Ml [12], there cannot be 
any focal point along the (xo x Af)-geodesic 

{TTHT-,t (xt,Pt) -te (0,T]}. 
Condition (iii') guarantees that this extends to < = 0, i.e. there is no focal point along 

{nHT^t (xt,Pt) -te [0,T]}. 

We can then use [151 lemma 2.9 (b)] to conclude that /" (ho) is positive definite. ■ 

Definition 8 (Condition (ND); generalized ^ cut-locus condition) We say that {xq} x 
where iVa :— (a, •) :— {x e K'' : II/x = a G M'} satisfies condition (ND) if 

(i) 1 < #/Cf" < oo, 

(ii) the deterministic Malliavin covariance matrix C (h) is invertihle, Vh G /CJf' 

(iii) Xo is not focal for along h, for any h £ /C™'"- 

When tJo = and I = d, i.e. N = {y}, and #/C™'" = 1, condition (ND) says precisely that 
(xo, y) is not contained in the sub- Riemannian cut-locus in the sense of Ben Arous [8^ ; extending the 
usual Riemannian meaning. In this sense our (global) condition (ND) is effectively a generalization 
of the well-known " ^ cut-locus" condition in the context of heat-kernel expansions. 

Theorem 9 (Small noise) Let (X^) he the solution process to 

dXI = b (e, XI) dt + ea (Xf ) dWt, with Xg = x^ G R''. 

Assume b (e, •) — )■ CTo (■) in the sense of and Xq = Xq — > xo as s in the sense of Q). 

Assume the weak Hormander condition IT^) at xo G Mf^ . Fix y G K', A'y := (y, •) and assume that 
{xo}x A^y satisfies (ND), i.e. the generalized ^ cut-locus condition (in particular then, #/C™'" ^ Ij- 
Then the energy 

A(y)=inf|i||h|!l,:hG/Cy| = i||ho||l,. 

is smooth in a neighbourhood of y provided #/C'°'" = 1; otherwise i.e. when =i^/C™'" > 1, we assume 
soE 

Fix Xq, y and T > 0. Then there exists cq — cq (xo, y, T) > such that 

Yfr = UiXfp = (^X^\ X^'^ , l<l<d 
admits a density with expansion 

A(y) max{A'(y). Y3,(ho):hoe>C»'"} 

f,{y,T) = e-^e e'' {co + O (e)) as eiO. 



-•^"It will not be true in general, when #/C™'" > 1, that A (y) is automatically smooth near y. To wit consider, 

ICf'' = {ho(y),ho(y)}. Then A (y) = min (|||ho (y) , |||ho (y) and even if ||ho (■) lllf and ||ho (■) IIh are 
smooth near y, this need not be the case for the minimum. 
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Here Y = Y (ho) = \ , • • • , Y'^j is the projection, Y ^IliX, of the solution to the following (ordi- 
nary) differential equation 

dXt = (O, 0^ (xo)) + dM^'l" {^o))ho {t))Xtdt + deb (O, 0^ (xo)) dt, (18) 

Xo = de\e=0^o- 

Remark 10 The assumption ICy 7^ 0, implicit through #/C™™ = 1 in the statement of the above 
theorem, is known to be necessary for the existence of a positive density; in presence of 0) and 
invertibility of C (h), some h ^ ICy it is actually sufficient; \1 If . The strong Hormander condtion 
at all points \H1\) is well-known to ensure ICy ^ 0; a less well-known and subtle condition of 
weak- Hormander type is given in J33f . 

Remark 11 When applied to small time expansions, the weak Hormander condition in the above 
theorem automatically reduces to the strong Hormander condition at xq; indeed, the "drift" vector 
field in the weak condition will be the limit of times the original drift vector fields; plainly this is 
zero and therefore does not figure in the span. 

Proof. Assume fflC™^'^ = 1 and see remark [T2l below for the reduction of #/C™° < 00 to this case. 
The basic remark is that fe{y,T) is the Fourier inverse of its characteristic function, 

E [exp [ii ■ Yf,)] = E [exp (i(e, 0) • Xf,)] 

where we write (^, 0) = , . . . , 0, . . . , 0^ G W^. In other words, it suffices to restrict the c.f. 
of Xfp, the full (Markovian) process evaluated at time T, to obtain the c.f of Y|,. The density is 
then obtained by Fourier-inversion. When X|, is affine the c.f. is analytically described by ODEs; 
(approximate) saddle points are easy to compute and the Fourier inversion - after shifting the 
contour through the saddle point - becomes a finite-dimensional Laplace method which leads to the 
desired expansion of /e(y,T); in essence, this approach was carried out by Friz et al. in [24] . In 
our present situation, of course, X does not enjoy any affine structure, but - following Ben Arous 
[8], who considers the "point-to-point" case I ^ d; a, similar approach works and ultimately boils 
down to Laplace method on Wiener space [9j. The differences to the setting of [HI, aside from (i) 
allowing for I < d, is that (ii) our drift-term does not vanish of order (which is typical when 
aiming for short time asymptotics; cf also proposition [TSl below) and (iii) that the starting point is 
allowed to depend on e. In fact, (ii),(iii) are responsible for the additional exponential exp {(...) /e} 
factor in our expansion (Such a factor was already seen in the general context of Laplace method 
on Wiener space [9].) Also, (ii) implies that the limiting vector field ctq = lim£6(e, •) affects the 
leading order behaviour in that the energy A (y) has no geometric interpretation as square of some 
(sub)Riemannian point-subspace distance. In particular, if we want to implement the strategy of 
[5] we are forced to revisit the meaning of all geometric concepts (cut-locus, geodesies, conjugate 
points ...) upon which the work ;8, is based. The key observation is that essentially all geometric 
concepts channel through the (non-geometric, but infinite-dimensional) condition (iii) of proposition 
[7] into the application of Laplace's method. Now, the whole point of proposition [7] was to provide 
check- able conditions for xq, y to satisfy (iii). Having made these part of our assumption we are in 
fact ready to proceed along the lines of Ben Arous [8j . 
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Fix y and note that for any C°°-bounded function z i-> F (z) on M' , by Fourier inversion, 

1 



fe{y,T)e 



(2^) 
1 

(2^ 
1 



E 



exp(^^(Yf.-y)-:^^ 



exp iC, 



Yf.-y 



F(Yf,) 



exp(z(C,0).(^^2-i^Me 



(19) 



(20) 



In particular, the last integrand can be computed, as asymptotic expansion in e for fixed C, by 
Laplace method in Wiener space, cf. [5] , [S] , based on the full (Markovian) process . We pick F 
(for fixed y) such that F (•) + A (•) has minimum at y, i.e. 



A(y) =inf{F(z) + A(z) 



z e 



and such that this minimum is non-degenerate; a natural candidate for F (z) would then be given 
(at least for z near y) by 

z i-> A |z — y|^ — A (z) , some A > 0; 
orzK^A|z-y|'-[A(z)-A(y)], 

since adding constants is irrelevant here (recall that y is kept fix). The trouble with the above 
candidate is their potential lack of (global) smoothness of A; even in the classical Riemannian 
setting A will not be smooth at the cut-locus. On the other hand, A (•) is smooth near y in case 
#/C™'" = 1; this is seen exactly as in [H Thm 1.26]. (In the case 1 < #/C™'", smoothness of A (•) 
near y was in fact part of our assumptions.) It is thus natural to localize the above candidates 
around y which leads us to define F, at least in a neighbourhood of y, by 



F(z) = A|z-y|^ 



d_ 

dy 



1 

A (y) (y - z) + 2 ^ A (y) (y - z, y - z) 



a routine modification of F, away from y, then guarantees C°°-boundedness of F. (Since F (y) = 
with this last choice of F, the l.h.s. of (fT9|) is actually precisely /e(y,r) ) Non-degeneracy of 

the minimum y of F entails that the functional iJ 9 h i— > F ^cj)^ (xq)^ + ^ ||h|j^ has a non- 
degenerate minimum at ho G H . (The argument is identical to (8l Thm 2.6] and makes crucial use 
of proposition [71) The Laplace method is then applicable: we replace edW by edW + dho in (jH) 
and call the resulting diffusion process Z^. The integrand of ([20l) can then be expressed in terms 
with X*^ replaced by Z^; of course at the price of including the Girsanov factor 



g := exp 



ho it) dWt 



1 

2? 



ho it) 



dt 



exp 



ho it) dWt-^Kiy) 



A stochastic Taylor expansion of , noting right away that 

F (n,Z|,) l,=o = F (n,4 (xo)) - F (y) = 0, 
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then leads to (of. [HI Lemme 1.43]) 



exp( -li^(nzZ^) 



: exp 



: exp 



F{y)-e ho (t) dWt - eWiXr ■ dyA (y) + O (s^ 



i / ho (t) dWt + i (Yt) • dyA (y) + O (1) 



(21) 



Putting things together, we have, using F (y) = 0, and noting cancellation of ho (t) dWt in ([5T 
with the identical term in the Girsanov factor Q, 



fe{Y,T) 



E 



gxexp z(C,0) 



(y,o) 



d( 



= i exp (-1a (y) j exp Q (Yt) • OyA (y) 



1 



(27r) Jr' 



E 



exp(*(C,0). 1^2— ^) )exp(0(l)) 



dC 



(22) 



where O (1) denotes the term, bounded as e J, 0, from (HI]). What is left to show, of course, is 
that Co, i.e. the final factor in the above expression, is indeed a strictly positive and finite real 
number. But since our analysis is based on the full Markovian process (resp. Zj. after change 
of measure), the arguments of O Lemme (3.25)] apply with essentially no changes. In particular, 
one uses large deviations as in [8|i Lemme (3.25)]) and, crucially, non-degeneracy of the minimizer 
ho £ H, guaranteed by proposition [71 Finally, integrating the asymptotic expansion with respect 
to C S is justified using the estimates of [H Lemme 3.48], obtained using Malliavin calculus 
techniques. At last one sees cq > 0, as in fS', p. 330]. ■ 

Remark 12 (Finitely many multiple minimizers) The case 1 < #/C™'" < cx) e {2,3, ...}is 
handled as in 19]. If 

and invertibility of the Malliavin matrix as well as non-focality holds along each of these, the ex- 
pansion for /e (y,T) as given in theorem\^ remains valid. Indeed, after localization around each of 
these n minimizers, 



fAy,T)=\ Yl e-^e "."'°' £-'co(ho)| (l + 0(e)) 



^ (const) e e 
where Yt (ho) denotes the solution of il8\) . 
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Remark 13 (Localization) The assumptions on the coefficients b, a in theorem\^ (smooth, bounded 
with bounded derivatives of all orders) are typical in this context (cf. Ben Arous for instance) 
but rarely met in practical examples from finance. This difficulty can be resolved by a suitable 
localization which we now outline. Set tr := inf |< G [0, T] : sup^gjQ \X^\ > i?| and assume 



"[tr <T] <e-'^«/^' as eiO 



with Jr ^ oo as i? oo by this we mean, more precisely, 

lim lim sup logP [t^, < T] 



-oo. 



(23) 



In that case, we can pick R large enough so that A {y) < Jr, uniformly for £ near 0+ , and can expect 
that the behaviour beyond some big ball of radius R will not influence the expansion. In particular, 
if the coefficients b, a are smooth, but fail to be bounded resp. have bounded derivatives, we can 
modify them outside a ball of radius R such as to have this property; call b, a these new coefficients 
and X'^ the associated diffusion. To illustrate the localization, consider I = 1, i.e. 
the distribution function for Yrf, . Clearly, one has the two-sided estimates 



e4 



and 



[YS > y; TR > T] < P [y^ > y] < P [Y^ > y; tr >T]+P[tr<T], 



and similar for Yj, = X^^ . Since P [Yj. > y;TR > T] 



Yj' > y, T R > T it then follows 



YS>y]~P Yf>y < P [tr <T]<e 



In particular, any expansion for Yj, of the form 



YS>y 



— p-ci/e c-ije -I 



CO (1 + O (e)) 



leads, upon taking R large enough so that Jr > ci, to the same expansion for F[Y^ > y]. With 
more work of routine type, this localization also be employed for the density expansion in theorem 

m 

2.1 Corollary on tail expansions 

We have the foUowing apphcation to tail behaviour of, say, the first component (i.e. I = 1 here) 
of a diffusion processes at a fixed time T. The scaling assumption below is met in a number of 
stochastic volatility models. 



Corollary 14 (Tail behaviour) Assume Xq — > G R as e and some diffusion process X^ , 
started at Xq, satisfies the assumptions of theorem\^ with xq = and N = (1, • 
particular, {0} x (1,-) is assumed to satisfy condition (ND). Assume also 6-scaling by which we 
mean the scaling relation 

YS ^'"^^ e'Yt where Y = n,X 
for some 6 >1. Then the probability density function ofYx has the expansion 

f{y) = e-^'yK^-y^y^-^(a^ + o{l/y^")) as y ^ oo (24) 
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where 

ci-A(l) 

C2=fTA'(l) = ^A(l) 

In particular, when 9 ^ 1 we have a Gaussian tail behaviour of the precise form 
while 9 — 2 leads to the exponential tail of the precise form 

Proof. Let denote the density of Y^. Since / (y/e^) = £^ fe {y) we can take y = I and = 
in the theorem below. Another observation is that the assumed scahng imphes 

Ao {y) = y'^'Ao (1) 
and hence Aq (1) = |Ao (1). The rest is obvious. ■ 
2.2 Corollary on short time expansions 

Finally, we have the following application to short time asymptotics. Note that for I < d, the pro- 
jection of X is non-Markovian and there is no Fokker-Planck equation that describes the evolution 
of /. In particular, there is no direct PDE approach that leads to the expansion below. 

Corollary 15 (Short time) Consider dXt = b{Xt)dt + a{Xt)dW, started at Xq — xo £ M'', 
with C°°-bounded vector fields such that the strong Hormander condition holds, 

Vx e K'^ : Lie [cti, . . . , a™] U = T;^''- (HI) 

For fixed I G {1, . . . ,d} assume {xq} x iVy, where Ny := (y, •) for some y € R', satisfies condition 
(ND). Let f (i, •) / (t, y) be the density of Yt ^ {X} , . . . , X|) . Then 

f {t,y) ^ (const)^exp (^- ^ ^2^'^'' ) """^^^^ 

where d{xQ,y) is the sub-Riemannian distance, based on (di, . . . , Cm), from the point xq to the 
affine subspace Ny. 

Proof. After Brownian scaling, we apply the theorem with T = 1 , = i so that 

6 (e, •)=£'&(•) ^fTo(-) =0; 

which explains why there is no drift vector field in the present Hormander condition IHll Also 
Xq = Xq here. The identification of the energy with 1/2 times the square of the sub-Riemannian 
(or: control - , Carnot-Caratheodory - ) distance from x to Sj, is classical. At last, the unique ODE 
solution to is then given by F = and there is no exp{(...) /e} factor. ■ 
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3 Non-focality and infinite-dimensional non-degeneracy 

In the present section only, we write h (rather than Hq) for a fixed element in iiT™™. 
Recall Th^a — kcrZ? {Uitprp) (li) =: Hq. Since h e /C™'", it is critical in the sense that 

I' (h) = DI (h) = on Tli/Ca = Hq. 
Also recall — 0t<-o (^o) — 4>t (h), notation used when xq is fixed. Given 

q e span I da; ^ I • • • , dx^\^T } 

with 1 < Z < d we shall writ j"1 

(g,o) e s^&n{dx\^,...,dx%^] 

for q "viewed" as element in T^^M!^- We can describe as the set of those k — (fc^, . . . , fc™) e 
such that, for any q e span {^dx^ jx^, , • • • , rf^;' Ut } , 



fc,Mt = 0; 



Ph (a) - 



where, of course, {a;^, . . . , x"*} denotes the standard coordinate chart of and we tacitly use Ein- 
stein's summation convention. We recall our standing assumption that the deterministic Malliavin 
covariance matrix C (h) is invertible. 

Lemma 16 The linear map p^^ : span |da;"'^|xr , • ■ • , da^' |xt } given by 

^/o((9,0),$^^,ai (^J'^^ {xT)))dt^ 

^/o ((<?, 0) , <i>h ^,a„, (^<th^T (xt)) ) dtj 
for i — 1, . . . ,m and t E [0, T] is one- one with range i?(f . 

Proof. Since Hq is the set of those k E H such that, for any q E span jda^-'^lxr, . • . , dx'lxr}, 

[{q, 0) , ((/)^Vt (xt)) ) fcjdi = 

we see that Hq is the orthogonal complement in H of 

{Ph (<?) : 9 G span 

i.e. H(f is the range of p^^. Invertibility of the deterministic Malliavin matrix (along h) then implies 
kerpjj — {0} which shows that p-^ is one-one (and also that has dimension I). ■ 







l^In fancy notation, (g, 0) = (11;)* q where (11;)* is the adjoint of (11;)^ : T^j.R''- -s> Tn,x^'R' , the differential of the 
projection map IIj : (x^, . . . , x'*) — (x^, . . . , x') 
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Lemma 17 For each minimizer h e /C™'"; there exists a unique q — q (h) G span |(ia;^ jx^ , ■ • ■ , dx'' jx^ } 
s.t. 

h^D^j. (h)*[(g,0)]. 

(Recall D(prp (h) : iJ — > T^^K'^; its adjoint then maps T*^M.'^ — > H where we identify H*with H .) 

Proof. By assumption, h is a minimizer, and so its differential /' (h) is on Th/Ca = ^^o- It follows 
that for every k £ Hq, 

pT rri 

(d/(h),fc)= / y2hlkldt = 

so that h is in the orthogonal complement of Hq. It follows that there exists a (unique, thanks to 
invertibility of the deterministic Malliavin matrix along h) 

q = q{h) £ span {dx^\^^ , . . . , dx' Ix,, } 

such that h = (q) . It follows that 

/ij = ((9,0),<i>h^,a. (</>^^r (xt))). 

It remains to see that, for any k £ H , 

(fc, h)^ = (fc, D<p.r (h)* U 0)])h = ((9, 0) , Dcj^rr (h) W) , 
but this follows immediately from the computation 

(fc,h)^== {k,p^{q))fj 

dt 



(<Z,0), r<I>^^,a.(0^^^ (xt)) k\dt 



Lemma 18 /" (h) is a bilinear form on Hq given by 

I" (h) [k, I] = (fc. Oh - {{q (h) , 0) , D^<i>T (h) [fc, I]) 

= {k,l)^- {q{h),D^^^{h) [k,l]) 

where (g (h) ,0) G T^^R"^ was constructed lemma [T7| In particular, an element k G Hq is in the 
null-space U (h) of I" (h), 

kc,N{h) := {keHo: I" (h) [k, k] ^ 0} 
= {keHo: I" (h) [k, ■] = () onHo}. 

if and only if ( identifying H* with H ) 

{k,-)H-{PT.D^i^{h) [fc,-]) ei?o^. 
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Proof. Take a smooth curve c : (— e, e) — > Ka s.t. c (0) = h, c (0) = k. Then 

I"ih) [k,k]^\k\l + {h,-ciO)). 

From the previous lemma 

/" (h) [k,k] = \k\l + {{q,0),Dcj,r (h) [c(0)]) 
= \k\l + {q,Di^T (h) [c(0)]) 

On the other hand, since V't (c (t)) = ^I't'T (^)) = a for < e (— e, e) we have 

(P 

= ^V^T (c(t))|t=o 

= ^Z)VT(cW)[c(t)]|t=o 

= i^^T (h) [fc, fc] + DiPt (h) [c (0)] 

/" (h) [k,k] = \k\l-{q,D^^j. (h) [fc,fc]) 

-|fc|'^-((g,0),I?2,/,^ (h) [k,k]). 

The characterization of elements in A/" (h) is then clear. Let us just remark that J\f (h) is indeed 
equal to the space {k G Hq : I" (h) [fc, •] = on Hq } as is easily seen from the fact that /" (h) is 
positive semi-definite, since h is (by assumption) a minimizer. ■ 

If C/ is a vector field on R'' we define the push-forward, under the diffeomorphism (^(p^^T 

by 



and hence 



U (0^^T (^)) e TM' 



We shall then need the following known formula, cf. |14i 1.21] combined with trivial time reparam- 
eterization t T — t: 



{z) kids. 



(25) 



Lemma 19 For k,l G H we have, with vlt = 4't (h) 

(-T i-T 



D^(j)j, (h) [fc, /] = 



Jt 



(xt) kll\dsdt 



Proof. Clearly 



D(j)j, (h) [/ 



^ ^^^t^a, (4>\^T (xt)) ^^^t-D-^t (h) [k] i\dt. 
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where (jjrp (h) — x^. Perturbing h implies 

(j)rp (h + ek) = XT + eD(l)rp (h) [k] + o (e) 

and then ^ 

D^j, (h + ek) [I] = ^ ^rt'f ^» (-^^^T (xT + ei?0T (h) [k] + o (e))) /jdt. 

Taking derivatives then leads us ted 

7^20^ (h) [fc, l]=J D (<^^^j, (xt)) } [k] i\dt 

+ J $h (^',Vt (xt)) <i>?^T^'/'T (h) [A] z'idt 



The proof is then finished using ([25 
Given k G Hq, set 

'0 

where the notation is meant to suggest that 



D(t)j, (h) [k] (26) 



7] e %^N^ where = (a, •) C x R"*"' = R'^. 

Proposition 20 Elements k £ A/" (h) C Hq are characterized by (inhomogeneous, linear "hack- 
ward") Volterra equatio"^^ 



kl = ({qih),0), 



+ (^iq (h) , 0) , 'f^^tda, (<^^^T (xt)) 'I'Lt 
+ ((^?,0),<i>h^,a, (xt) 



(xt) A:^ c?s 



where 



is given by i26\) and 



ri = V (k) e span{5i+i|x5,, . . . jS^Ixt} = T^T^a- 

e = 0{k)e sp&n{dx^\^^,...,dx^\^^} =T*^N^. 

Remark 21 When k e A/'(h) is a/so m i/i = her Dc/jrp (h) (which is always true in the point-point 
setting!) we have 77 = 0; the equation for k simplifies accordingly and matches precisely the Bismut's 
equation 114[ 1-65]. 



^^It should be noted that the term D4>j, (h) [fc] is zero for k £ H\ = ker D(f>j, (h); in particular the second summand 
will vanish when D^ijyT C^) [1 '] restricted to H\ i.e. when considering the point-point case / = d. 
... which takes the usual form upon rcparametcrizing time r T — t ... 
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Remark 22 It is an important step in our argument to single out rj. In fact, we must not use 

i-T 



[fl^r (xt)) kids 



as integral term for k in the above integral equation for k. Indeed, doing so would lead to a Fredholm 
integral equation (of the second kind) for k whereas it will be crucial for the subsequent argument 
to have a Volterra structure. (Solutions to such Volterra equations are unique; the same is not true 
for Fredholm integral equations.) 



Proof. For fixed k G Hq, we write 



:= Dcl^T (h) [k] . 



With slight abuse of notation (Riesz!) the previous result then implies that 



{D'cp^ (h) [k,-]}]^ 



+^\^,da, (</.1Vt (xt)) <i>t: 



(xt) kids 



On the other hand, for fc e A/" (h), we know that 

{k, - {{q (h) , 0) , D^cj^T {h) [k, •]) e = range (pj ■ 

Hence, recalling 
it follows from (I??)) that 

i-T 



kl^(iqih),0), 



(g(h),0),<i>^^,aa, (^^T (xt)) 
(0,O),$^^,a, (0^^T (xt) 



(xt) kids 



(27) 



Remark 23 // we introduce the orthogonal complement H2 so that 

Hq = Hi II2 (orthogonal) 

the map 



k ^ Dcj,^ {h)[k]={^^]^v 



is a bisection from H2 — J> TxT^a- 
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3.1 Jacobi variation 

Again, the starting point is the formula 

/lH(pT,$^^*a, ((/.^T (xt))) 

= ((g(h),0),<i>^^,a, (^tr (xt))) 

where we recall 

PT = ((z(h),0), XT e (a, •) = A^a. 
We keep pr and fixed and note that the Hamiltonian (backward) dynamics are such that 

ni?t^T (xt, Pt) = 0tVT (xt) 

Replace pr by pt + £(^,0) above, xt by xt + e ^^"-^ write h (e) for the according controf^ 
which satisfies the relation 

h ie)l ^(pT + s {e, 0) , $^tV» [^'tl (^^ + ^ (S))) ) 
Define the Jacobi type variation 

so that 

gl = (pt, D (0^"^^ (xt)) } [g]) 

+ ^PT, <^>'i^t^<J^ (0^^T (xt)) <f l^T 

+ ((0,O),$^^,a,(0tT (xt))). 

With Pt = (q(h),0) and formula we see that satisfies the identical (inhomogeneous, linear 
backwarcF^ Volterra equation) as the one given for k in proposition 1201 By basic uniqueness theory 
for such Volterra equations we see that g — k as elements in ([0,T] ,M™), and hence g — k as 
elements in H. 




Proposition 24 Let k (z J\f (h) C Hq with associated parameters 

6lespan{da;^|xT.,...,rfa;'|xr} =T*^N^ 
?7espan{9;+i|xr,---,9d|xT} ^Ty.TN^ 



which can be constructed explicitly from the Hamiltonian (backward) flow 

{xt (e) , vt (£)) := Ht^T (^xy + £ °) , Pt + e(e, 0) 

and the usual formula h (e)J = (pt (e) , Vi (xt (e))) • 

^^Trivial reparameterization t T — t will bring it in standard "forward" form. 
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provided by vrovosition \ 2(A (In particular, rj is given by D(j)rp (h) [k], cf. i26]). ) Then k can be 
written in terms of a Jacobi type variation 

k = 9(e,r,)/i- 

Conversely, any Jacobi type variation, with € T^r^N^,j] G Txt^s, yields an element in Af (h). 

Proof. The first part follows from the above discussion and it only remains to prove the converse 
part. Since we have seen that every Jacobi type variation g :— d(^e^ri)h satisfies the appropriate 
Volterra equation, cf. proposition [20l we only need to check 





and we leave this as an easy exercise to the reader. ■ 



Recall that we say that xq is non- focal for (a, •) = Ng_ along h if for all 6 e T^^N^, rj G Txt^i 

5e|s=on//o^T (xT + £ f ° ) ,PT + £(0,O) ) =0 =^ (9,^)^0. 



In the point-point setting (i.e. / = d so that 9 £ T^^M.''-, f] — 0) the criterion reduces to 

de\e=QUHo^T{xT,PT+e9) = =^9 = 0; 

disregarding time reparameterization t ^ T — t and the fact that our setup allows for a non-zero 
drift vector field, this is precisely Bisniut's non-conjugacy condition ,14, p. 50]. 

Corollary 25 The point xq is non-focal for (a, •) = iVa along h if and only if I" (h), i.e. the second 
derivative o/ ||-||^ at the minimizer h, viewed as quadratic form on Hq = kcr D (11/ (h), is 
non-degenerate, i.e. 

AA(h)EE{0}. 

Proof. Take fc G 7V(h) ; from proposition [24l 

k = d(0^^)h = de\e=oh (e) 
for suitable 9 G T*.^N^,rj G %j,Ns,; in fact, 

= D(t>T^Q (xo) [k] . 

\ '// 

The criterion says that if 



de\e=oTlHo^T ( Xt + £ ( ^ ) , PT 



+ £(0,0)) =a,|e=o {d^otr) (^^ + ^(!!)) 
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equals zero then {9, rj) must be zero. But this is indeed the case here since 

= D {0J,Vt (xt)} [d,U=oh (e)] + (^IJ) 
= {0!,Vj. (xt)} [/c] + ci>|;^^i?4^o (xo) [k] 

{0S^T°0T4-O (xo)} W 



= 0. 



We thus conclude that the directional derivative which of course depends linearly on (9, rj), 

vanishes. It then follows that k = d(^g jjjh = which is what we wanted to show. 
" : Assume there exists {9, r/) 7^ so that 



d.U^oTlHo^T (^XT + e(^JJj ,pT + e(f?,0)j =0. 

Then k :— yields an element in the null-space A/'(h). We need to see that k is non-zero. 

Assume otherwise, i.e. fc = 0. Then Dcjy^^^Q (xq) [k] = and hence also r/ = 0. From the Volterra 
equation for k we see that 

0^{{9,0), (^tr (xt)) ) = ft, 0)) . 

But kerpij was seen to be trivial and so = 0; in contradiction to assumption {9,1]) 7^ 0. ■ 
4 Applications 

4.1 Scaled Ornstein-Uhlenbeck 

As a warmup, consider a scaled one-dimensional Ornstein-Uhlenbeck process of the form 

dr/ = {ae + PYi) dt + -fedWi, = eyo G K. 
Clearly, r| ~ iV {en,e^a^) with 

^^ = yoe^^xoe^^ + ^ {e^^ - l) , a' ^ ^ {e^^^ - l) (28) 
and so has density of the form 

r (y, T) = ^_e-^S# ^ (cq + O (e)) , (29) 



with constants implicitly defined by (1^^ . We leave it to the reader to check that the same 
density, with explicit expressions for ci , C2 is obtained from our main theorem. 
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4.2 Elliptic example with degeneracy 

Consider the small noise problem for the stochastic differential equation 

dY'' = edW^ + eZ^edW^, = 0; 
dZ^^edW^, Zl ^ 0; 

where 9 G [0, 1], say. Note that it could be immediately rephrased as short-time problem (T = 1, < = e^) 
We are in an elliptic (Riemannian) setting. (In fact, with the induced metric has zero-curvature 
and empty cut-locus.) Clearly, Yj^ admits a density, say (y) at time T — \. Considering the 
point 2/ = 1, for instance, it is not hard to see that 

£2 log r (1)^-1 aseiO. 

At least when 6* = it is obvious from Y^ ^ TV (O, e^T) that one has the expansion 

r(l)=£-ie-^(co-)-0(£)) 

for some (easy to compute) cq > 0. Interestingly, the general situation is much more involved. 
Exploiting the fact that Y^ can be written as the independent sum of a Gaussian and a (non- 
centered) Chi-square random- variable, p (y) is given by a convolution integral and a direct (tedious) 
analysis shows that 

.e^^ fe-'e'^ (co + 0(e)) whence [0,1) 

^' \ e^^^^e"^ {co + 0{s)) when 6l = 1 ' ^ ' 

While the energy is equal to 1/2, no matter the value 9 G [0, 1], we see the appearance of an atypical 
algebraic factor e"^^'^ in the case 9=1. 

With a view towards applying our theorem |9l we have vector fields ai,a2 of the form dy, 
9zdy -t- dz- One checks without difficulty that hg {t) = {t, 0) is the (unique) element in /C™'", for any 
9 = [0,1]. In particular, the "most-likely" arrival point is (1,0) G (I,-)- (Minimizers and energy 
start to look different when 6 > 1, our focus on 9 e [0, 1] is pure algebraic convenience.) In the case 
9 = 1, the explicit "backward" and projected Hamiltonian flow is 



ttHq^t ((2/t, Zt), {pt, Qt)) 

yT + \ {ptztT + qtT - zrf - {ptT + ^z^ 
Zt — QtT — ptztT 



From this expression, it is then easy to check that (0, 0) is focal for (1, •). (Proposition[7]then implies 
that the Hessian of the energy at hp is degenerate. In fact, a simple computation shows that in this 
example the null-space of /" (ho) is given by N (ho) = [k] where k = k (i) = (0, t) e 7hoKi\ {0} .) 
It follows that one must not apply theorem [9] here, and indeed, the predication of the theorem 
(algebraic factor e~^) would be false in the case 9 = 1, as we know from (|30|) . On the other hand, 
one checks without trouble that for 9 < 1 the situation is non-focal, all our assumptions are then 
met, and so theorem [3] yields the correct expansion, in agreement with pop . 



24 



4.3 Hypoelliptic Gaussian example 

We are interested in the tail behaviour of Yt where 

dY = Zdt, Yo = yoi 
dZ = dWt, Za^ZQ. 

Of course, Yt is Gaussian with mean fi = ya + zqT and variance 











cr^ := V [Yt] ^ E 


/ / WsWtdsdt 


= 2 


/ sdsdt 


Jo Jq 




Jo<s<t<T 



[ t^dt = TVs. 

"'0 



We are not looking here at the short time behaviour of Yt as t 4- 0: Indeed the condition IHl I is not 
satisfied here and indeed the log density of Yt is proportional to l/cr^ = O {t~^^ as t J, which is 
not at all the behaviour described in corollary (fT5|) . Instead, let us fix T > and note that the 
density of Yt is of the form 



1 



\/2^ 



(const)e"i*^(^'"^^^) = (const)e~'=i^'+"^'' as y f oo. 



(31) 



We now illustrate how this follows from corollary [TH 
Scaling: Set Y^ := eY and similar for Z. Then 



Ztdt, 



dZf = edWf. 



V' 
r 



In other words, the (first) component of interest to us scales with = 1. It remains to check the 
assumptions. With (Jq = zdy and ai = dz we have [cto, cti] = dy which not only implies the weak 
Hormander's condition IHl but a stronger "Bismut 112 type" condition which implies [HI Thm 1.10] 
invertibility of C^'"=" for all /i ^ 0. We are interested in paths going from lime_j.o {Yq^ ^a) — (Oj 0) 
to = (a, •) with a = 1 and it is easy to see that this is possible upon replacing W by a suitable 
Cameron-Martin path; in other words, 

(Cf. [33] for an abstract criterion that applies in this example). Since ti = will never stir us from 
(0, 0) to N we only need to check that (0, 0) x satisfies condition (ND). To this end, we note that 
the Haniiltonian in the present setting is 



'H{iy,z);{p,q)) =pz- 



1 



The Hamiltonian flow H^^o = Hf-(-o iVi z^p,q) turns out to be an easily computable linear map. 
The details of computing the minimizing control h = h{t) then follow the recipe given in remark |6l 
In particular, we find po = (po, qo) = (3/T, 3/T) and minimizing control ho = ho (t) = 3{T — t) /T"^ 
in the notation of proposition 31 In particular then. 



Cl 



A(l) = - 
^ ' 2 



hit) 



dt = 



3 

2r3 
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in agreement with pip. For the second order constant C2, we need to compute Yt where 

dYt = Ztdt, % = yo, dZt = 0, Zq ^ zq. 

This leads immediately to Yr = yo + zqT =: /i and then C2 = (2YT/0j A (1) = 2/j,ci, again in 
agreement with the Gaussian computation. 

4.4 Levy's Area 

Following a similar discussion in |49| . we consider 




the resulting diffusion 

dXt = C7i (Xt) dWl + G2 (Xt) dWl, Xo = xo e 

is known as Brownian motion on the Z- dimensional Heisenberg group] it can also be viewed as the 
Brownian rough path (e.g. [19] and the references therein) associated to the 2-dimensional standard 
Brownian motion (W^ , W^) . Set X"^ := where {x, y, z) — (ea;, ey, e'^z) is the dilation operator 
on the Heisenberg group. Then 

dXI = (Ti (Xf ) edWl + (72 (X^) edW^ 

which is also the form relevant to study short time asymptotics. Since the Hamiltonian is explicitly 
available [53], our criterion (ND) can be checked directl-yP^. In particular, we so recover all "non- 
degenerate" projected Levy area expansion results of [H]: in the notation of that paper, section 7, 
we cover their cases (I)i, (1)2, (in)i, (111)2, (111)3. The main difference, comparing the approach [49] 
with ours, is that our criterion (ND) bypasses the involved analysis, carried out by hand in [49], of 
the infinite-dimension Hessian of the energy at the minimizer. On the other hand, our approach 
(presently) does not deal with degenerate minima, and we do not cover their cases (1)3, (II), (111)4; 
all of which are, of course, ruled out by violating condition (ND). 

Let us conclude with an application to Levy's area not considered in [49]. We are interested in 
the tail-behaviour of the density, say f — f (z), of Levy's area Zt at unit time, 

^1 :=^^ {W}dW!-W^dW^). 

One expects — log / {z) ^ c*z for some c* > since Zi is an element of the second Wiener Ito-chaos 
which entailJ^ 

E [exp (cZi)] < 00 for some < c < c* (32) 
E [exp {cZi)] = +00 for some c > c*. (33) 

No explicit density is available but a c.f. is known (Levy's formula). Density expansions could 
then be obtained using saddle point methods, for instance. We now illustrate how such expansion 

^^Bismut's H2 condition guarantees invertibility of the deterministic Malliavin matrix for any h ^ 0. 
^^There arc examples of r.v.s with | |32| |. (I33| I with density / for this fails! 
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follows from corollary [HI To this end we exploit scaling, X*^ := S^X. By trivial relabelling of the 
coordinates (1 versus 3), our tail result shows, after verification of condition (ND), that Levy's area 
Zi admits a density of the form 

/(z) = e-^o(i)-z-V2 (c„ + 0(l/VI)) 
where Aq (1) is seen to be the Carnot-Caratheodory distance of (0,0,0) to the hyperplane N = 

4.5 Black-Scholes 

The Black-Scholes (BS) model, written in log-language is an example where the above theorem is 
applicable with = 1. Indeed, Y := log 5 satisfies, with fixed Black-Scholes volatility a > 

dYt = ~—dt + adWu Yo = yo= log Sq. 
Of course, Yt ~ iV (yo — <7^t/2, a^t) and the explicit Gaussian density 



/bs (i, y) = , , exp ■ 



{y-{y,~aH/2)) 



2 ■ 



immediately yields short time resp. tail expansions, 

/bs {t, y) - (const) r^'^ exp I 1 as U 0; any y e R (34) 



/bs (r, y) ~ (const) exp -r-y^ exp ° „ „^ — y as ?/ ^ cx); any T > 0. (35) 



1 2\ fyo~a^/2 

2^y ) [-^^ ^ 

We derive now both expansions from general theory, i.e. with aid of corollary [TS] resp The 
short time limit corresponds to a flat Riemannian situation, in particular the cutlocus is empty, 
which is enough to guarantee (ND); the remaining computations to derive p4[) from corollary [T5] 
are left to the reader and we focus on the (more interesting) case of tail asymptotics. Corollary [Ml 
applies with 9 = 1, and (rescaled) starting point eyo — >■ 0. Condition (ND) needs to be checked; 
the relevant Hamiltonian is 

2 2 2 

^(y,P) = -YyP+^, foraU (y,^) 
and the Hamilton ODEs are 

yt = — jyt + cr pt, pt = yPt, 

with boundary conditions j/o = and yx = a- We first solve the ODE for p and we obtain 
Pt = pe" */2^ for some p G R. We can then deduce the solution for the path y as a function of p: 

l_e--^t/2 /^2p 
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The boundary condition yx — a implies that 



2 

P = Po-=^ 



In particular, dpyrl — 1 — e '^^^/^ > Q, and hence invertible, for T,a > 0. En passant, we also 
deduce the optimal control hgit) = apt, and get the correct leading order factor 



2 " 2 7o ' 2a2r' 

With the hint = yo + (- 4) * ^^^^^^ to the reader to verify that C2 = (yo - cr2T/2) / {2a'^T) . 
Frequently, one chooses yo = in this context (which amounts to normalize spot price to unit). 

4.6 The Stein-Stein model 

For given parameters, a > 0, 6 < 0, c > 0, (Tq > 0, p = d (M^^jVF^) jdt, the Stein-Stein model 
expresses log-price Y , under the forward measure, via^^l 

dY = -^Z^dt + ZdW^, r (0) = yo = (36) 

dZ^{a + bZ)dt + cdW'^, Z(0) = (To>0. 

We will be interested in the behaviour, and in particular the tail-behaviour, of the probability 
density function of Yt- In fact, there is no loss of generality to consider T = 1. Applying Brownian 
scaling, it is a straight-forward computation to see that the pair [y , given by 

f(<):=r(tT), Z [t) := Z {tT)T^/'^ 

satisfies the same parametric SDE form as Stein-Stein, but with the following parameter substitu- 
tions 

a^h = aT^/'^, b ^ b = bT, c ^ d = cT, ao ^ = o^T^I'^. 
In particular then, Yt = Yt (a, 5, c, ctq, p) has the same law as Y\ {a, b, c, (Tq, . 
4.6.1 The case of zero-correlation 

For the moment, we shall follow |,27^ in assuming the Brownians to be uncorrelated, 

d{W\W^)^ = pdt with p = 0. 
Recall their main result, a density expansion for Yt of the form 

(*) : / (y) = e-'^ye--y'''y-'/^ (cg + O (y-'^') ) as y ^ oo. (37) 

Scaling: Setting 

K := e^r, Z^ := eZ 



^^Sometimes the Stein-Stein model is written with \Z\ dW^ rather than ZdW^. In the zero correlation case this 
does not make a difference to the law of the process. In fact, there is a recent tendency in the finance community to 
use the form ZdW^ which we analyze here, cf. 1391 . 
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yields the small noise problem 

1 



(38) 



dY^ = -^Z'^dt + Z^sdW\ (0) = =: yo Ve > 
dZ^ = (ae + bZ^) dt + cedW'^, Z^ (0) ^ eao =: zq as e J, 0. 

Our corollary I14[ assuming its application to be justified, then gives the correct expansion p7p . 
namely 

/(y)=e-^^e-^^^^y-i/2 (cg + O (y- ^2) ) ^ 

and also identifies the constants ci = A(l), C2 — 1tA'(1). (The leading order constant ciis in 
agreement with both \^Zi and 421 p40].) 



Remark 26 Corollary \14\ relies on an application of theorem\^to hS^) : let us note straight away 
that the coefficients here are smooth but unbounded. With a view towards the earlier remark on 
localization, and in particular I123\) . we note here that, due to the particular structure of the SDE, 
it suffices to localize such as to make a bounded; e.g. by stopping it upon leaving a big ball of radius 
R. This amounts to, cf. V2S\] . to shows that 



lim lim sup £ log I 



l^eloo;[0,T] > R 



But since 



'eloo;[0,T] 



> R 



;[o,T] 



> R/e 



and a is a Gaussian process, this is an imme- 



diate consequence of Fernique 's estimate. 



We postpone the justification that we may indeed apply corollary [T3] (which involves an analysis 
of the Hamiltonian ODEs) and proceed in showing how further qualitative information about the 
expansion can be obtained without much computations. 
Some information on ci: From the "theorem" 



ci := A (1) = inf { ^\\h\\l ■■ 4>o = (0,0) , 4 e (1, •) 



where 



,h.l 



Ah, 2 



dt + (pf.' dh^ , d(j)f 



h.2 



,h,2 



dt + cdii^. If then follows a priori that 



ci = ci (5, c; T) but not on a, ctq- 
The same is true for h* := ho and (p* :— (j)^°of course. 

Some information on C2: First, A' (1) — ci also only depends on the parameters b,c,T (but not 
on a, (To)- It remains to analyze the factor Yt where {Yt,Zt:t> 0) solves the ODE 



dYt 



i*,2 



Ztdt, Fo = 



dZt — bZtdt + adt, Zq — ao. 
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Since Zt = o'oe''* + a J* e''*^* ^-'ds it follows that Zt is linear in ao,a with coefhcients depending on 
b and T. Furthermore, noting that 



Yt 



Ztdt 



a similar statement is true for Yt and then C2 = A' (1) x Y^. Namely, for constants Ci — Ci {b, c; T) 

C2 = Ci (6, c; T) CTo + C2 (&, c; T) a. 

It is interesting to compare this with the Heston result [24j where the constant C2 also depends 
linearly on spot-vol ctq — ^/vq. 

Solving the Hamiltonian ODEs and computing ci After replacing edW by a control dh, 
and taking e J, elsewhere in (j38p , we have to consider the controlled ordinary differential equation 



minimizing the energy, ^ 



1 2 1 

dy = ——z dt + zdh , yo = 

dz — bzdt + cdh^ , zq — 0, 
dt subject to = a = 1 > 0. 



(39) 



According to general theory, we now write out the Hamiltonian associated to ([55]) , 
H 



2^ 
bz 



(40) 



The Hamiltonian ODEs then become 

yt 



bzt + c^qt 




(41) 



^PtZt (1 -pt) - bqt^ 

Trivially, pt = po which we shall denote by p from here on. As it turns out there is a simple 
expression for the energy. Although we shall ultimately take a = 1 it is convenient to carry out the 
following analysis for general a > 0. 

Lemma 27 For any ho G /C™'", and in fact any ho given by |J^| ), i.e. 

( 1 



ho (t) 



pzt 



(42) 



where {y,z;p,q) satisfies |^ j[ ), subject to boundary conditions (yQ,zo) — (0,0) and yx = SL,qT = 0, 
we have 

1 



A (a) 



In particular, we see that 



ho (t) 
P>0. 



dt — pa.. 
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Remark 28 In fact, linearity in a 0/ {501) also follows immediately from the fact that the Stein- 
Stein model satisfies 9-scaling with 9 — 2 in the sense of corollary | Indeed, it was seen in the 
proof of that corollary that the rate function A (a) scales like a2/« = a. This already implies that p 
does not depend on a. This is also consistent with the principle da.A (a) = pr pointed out in remark 

m 

Proof. We give an elegant argument based on the Hamiltonian ODEs. The idea is to express 
2 

ho (t) as a time-derivative which then allows for immediate integration over t e [0, T]. Indeed, 



ho it) 



2 

22, 22 

Zt +C Qt 



^p'z^t+dt{ztqt)-z^p' ^p) 
= 2pzf (p -1/2) + dtiztqt) 
= 2pyt + dt (ztqt) 

where we used the ODEs for z,q as given in (|4T|) . It follows that 

T . 2 

ho (t) dt = 2p (yr - yo) + (zrqT - zo9o) 





and we conclude with the initial/terminal/transversality conditions yo = zq ~ 0, yr — and qr = 0. 
■ 

Lemma 29 (Partial Hamiltonian Flow) Consider as initial value problem, with initial 

data (2/0,20) = (0,0) and (p^qo). AssumS^ 

Xl:=c^p{p~l)-b^>0. (43) 

Then the explicit solution is given by 

y^= ''"'f^r'^ (2x,^-sin(2x,^)), (44) 

qoc^ ■ f A 

Zt = sm [Xpt) , 

Pt =p, 

qt = 90 (cos (xpt) ~ — sin (xpt) 

Remark 30 The given solutions remain valid when Xp < 0; it suffices to consider Xp o,s purely 
imaginary; then, if desired, rewrite as cos (xj,^) = cosh(|xp|i) etc. Below, we shall solve as 
boundary value problem, subject to (yo, -^o) = (0, 0), t/t = a > and qx — 0; we shall see then that 
1^ j|) is always satisfied and in fact > 0. 



^^AU explicit solutions given in 1144 l l are even functions of XpQ a-nd have a removable singularity for XpQ = 0. By 
convention we shall always assume Xp„ ^ although the sign of Xp„ does not matter. 
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Proof. Let us first remark that the path {pt)t>o is constant, pt — P for all t E [0,T]. From the 



Hamiltonian ODEs, the couple {zt,qt)t>Q solves a Imear ODE in K^, so that the solution must be 



a linear function of (zo,qo) = (Oi^Zo)- Indeed, a simple computation gives 



qt = go ^cos (xpi) - ^ sin (Xp*)^ and zt = sin (xpi) , 
Elementary integration ("2 /q sin^ = t — cos sin t") then gives {yt)t>o direct integration; indeed 

y^={p- I) [ -Ids = {2x,t - sin (2xpt)) . 

This proves the lemma. ■ 

For the next proposition we recall the standing assumptions T > 0, 6 < (which models 
mean-reversion) and a > 0. 

Proposition 31 The ensemble of solutions to the Hamilton ODEs as boundary value problem 

{yo,zo) = (0, 0) and 1/7^ = a, = 

with a — 1 > are characterized by inserting, for any k e {1,2,...} and any choice of sign in 
below, 



1 / / 462 4r2 \ 



9o^. = ±4 7 (46) 

|^2p+,-l)r3(2,.,_sin(2r,)) 

in 1144^ ■ Here {rk : fc ~ 1, 2, . . . } denotes the set of (increasing) strictly positive roots to 

rcos(r) — 6rsin(r) — 0. 
Remark 32 j4s the proof will show, p as given in is the unique positive root to 

cMp-l)-b^^{'^y; 

in particular, assumption |^ j| ) in the previous lemma is met. 
Proof. By assumption and (jH]), 

= qT = qo (cos (xpT) - — sin (x^T)) . (47) 

At this stage, Xp could be a complex number (when Xp < 0). Let us note straight away that 
we must have qo for otherwise {yt)t>o ~ which depends linearly on qo as is seen explicitly in 
(j44|) - would be identically equal to zero in contradiction with = a > 0. Let us also note that 
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Xp 7^ for otherwise (|47p . which has a removable singularity at Xp — 0, leads to the contradiction 
= 1 — fer.(Rccall 6 < 0, T > 0.) But then r := XpT is a root, i.e. maps to zero, under the map 

r G C i-T- r cos r — bT sin r — r [cos r sin r ) . (48) 



A complex analysis lemma 27, Lemma 4] asserts that this map, provided 

-bT>Q, (49) 

has only real roots; it follows that Xp is real and so x^ > 0; actually Xp > 0; since we already noted 
that Xp 7^ 0. Note that (j47p . and in fact all further expressions involving Xp, are unchanged upon 
changing sign of Xpi we shall agree to take Xp > as the positive square-root of Xp- In particular, 
(|T7l) is equivalent to the existence of Xp > such that 

XpT cos (xpT) - 6Tsin (xpT) = 0. 

It follows that XpT € {r^ : fc = 0, 1, 2, . . . }, the set of zeros of pS)) written in increasing order. We 
deduce that, for each k — 0, 1, 2, . . . there is a choice of p arising from 



Xl^c'p{p-l)~b'^{^)' 



For each fc, there is a negative solution, say p = p^. < which we may ignore thanks to lemma \T7[ 
and a positive solution, namely 



I / 462 4r2 



We now exploit yx = a. From the explicit expression of yt given in (j44p we get 



ggc4(2p- 1) 



- = 2/T = ^^^qS-^(2XpT-sin(2Xpr)) 



and thus 

95 



^^^3 (2rfc - sin (2rfe)) 



2_ Sri 



c4(2p-l)r3 (2rfe-sin(2rfc)) 
It follows that, for each fee {1,2,...}, we can take 



, 1 / / 462 4 /rfc\2' 



2 2r3 



a 



(2p+-l)r3(2rfe-sin(2rfc)) 
and any such choice in ()44p leads to a solution of the boundary value problem. 
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So far, we have for each k S {1,2,...} two choices of (p, qo), depending on the sign in pS)) so 
that the resulting Hamihonian ODE solutions, started from {yo,zo) = (0,0) and {p,qo), describe 
all possible solutions of the boundary value problem given by the Hamiltonian ODEs with mixed 
initial/terminal data 

(yo, 2:0) = (0, 0) and yr = a, = 0. 

It remains to see which choice (or choices) lead to minimizing controls; i.e. hg G A^™'"- But this is 
easy since we know from lemma \T7\ that . for any p S {p^ :fc = l,2,...}, 



ho it) 



dt — pa. 



Since is plainly (strictly) increasing in k G {1,2,...}, we see that the energy is minimal if and 
only if p = p^. On the other hand, we are left with two choices for qo, namely q^^ and qQ^. Using 
(j42[) we then see that there are two minimizing controls, 

/Cf" = {h+,ho-}, 

given by 

/ p2££! sin (xpi) \ 

K{t)^\ . X with {p,qo) ^ {p+,q+j^) resp. (p^, go,l) • 

ycqa [cos [Xpt) - — sm [Xpt) j J 

Of course, h^ stands for hj resp. hg depending on the chosen substitution above. In (y, z)- 
coordinates, note that both hj and h,^ have identical y-components; their z-components only differ 
by a flipped sign due to qQi = —q^i- (This reflects a fundamental symmetry in our problem which 
is in fact invariant under (y, z) 1— >■ (y, —z)). We summarize our finds in stating that 

A(a) = i||h+||l,^i||h„||l,=p+a (50) 
and upon taking a = 1 we have computed the leading order constant 

ci = A(l) = 




where we recall that ri is the first strictly positive root of the equation rcos(r) — 6rsin(r) = 0. 

Computing C2 According to general theory, cf. equation (|18p , we need to compute certain ODEs 
for each minimizer, hj — {Hq',^ , h^]'^) resp. = (^(7'^, ]'^), exhibited in the previous section. For 
ease of notation we shall write (p, g^) instead of (p^,q^^^ resp.(p^, g^-^) in this section. Related 
to equation (|38p we then have to consider the following ODE along h^ (and then along h^) 

_fO{p~l)z+\ fYt\^fO 



with 



J \Zf J \a 
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Here, we used the fact that hg'^ = pz^ , indicates the chosen sign of go,i upon which it 
depends, cf. PS|) . The ODE along hp^for Y = is similar, with z^ .li^'l replaced by z^ = 
—z^jh^'l = —h^'l respectively. We can solve these ODEs explicitly. In a first step (regardless of 
the chosen sign for z,ho) 



aae''^ + f (e*** - l) for & < 
fJo + at for 6 — 



Y^ = {p-1) I zfZtdt 



Zt = 

and since 



we see that Yrp — —Y,^ .In fact, under the (usual) model parameter assumptions a > 0, (Tq > we 
see that Zt> 0. We then note that 

zf/qt = — Sin {xpt) > for < e [0, T] ; 

indeed we saw that XpT G [7r/2, tt) which implies Xpt G [0, tt) and hence sin {Xpt) > 0. In particular, 
given that q^j" > and p > Iwe see that 1^ > (and then Yf < 0). It follows that 

C2 : = c+ = A' (1) X y+'i 



Pip~l) z^Y/dt (51) 



whereas the contribution from C2 = A' (1) x y^, is exponentially smaller and will not figure in 
the expansion (cf. remark [T2|). In fact, given the explicit form of i 1— resp. Y^ in terms of sin (.) 
and exp (.), it is clear that the integration in (j5ip can be carried out in closed form. In doing so, 
one exploits a cancellation due to 

-Xp cos (xpT) + b sin (xpT) 

and also the equality Xp + = c^P (P ~ l)i one is led to 

tan iXpT/2) \ 



C2 = Qo < o-f) + a- 



7 



\p 



It is possible, of course, to substitute the explicitly known quantities ,Xp but this does not yield 
additional insight. 

4.6.2 The case of non-zero correlation 

We consider again the SDE (p6| with diffusion matrix 

a = (ai,a2) = ( Qc 

but now allow for correlation p between , W^; we thus have the non-trivial correlation matrix 

a. (;;j)^. a.- =(;;/-). 
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In view of financial applications [5D] it makes sense to focus on the case p E (—1,0]. This will 
also prove convenient in our analysis below, although there is no doubt that the case p > 0, less 
interesting in practice, could also be handled within the present framework. 
The Hamiltonian becomes, cf. (fTTj) . 

^ ( (0 ' ) ^ + + c'q') + pczpq 

with 

b := bp := b + pep 

Noting d(y^z)b = (0, 0)' , d(^p^q)b = (pc, 0)'. The Hamiltonian equations for z,p, q, are thus identical as 
in the uncorrelated case, one just has to replace b by b. (In particular, pt is again seen to be constant 
and we denote its value by p.) The Hamiltonian equation for y =^ dpH has, in comparison to the 

uncorrelated case, an additional term, namely (^dpb^ ztqt ~ pcztqt- In summary, the Hamiltonian 
ODEs are 

y^\ = ( ^^^-^ ^ PcztqA 
Zt) V bzt + c^qt ) 



Pt 







qtj \ptzt {1 - Pt) ~ bqt 

The following lemma is then obvious (only y requires a computation, due to the additional term in 
the Hamiltonian ODEs). 

Lemma 33 (Partial Hamiltonian Flow, correlated case) Consider the above Hamiltonian ODEs 
as initial value problem, with initial data {yo,zo) — (0,0) and (p, go) o.i^'d assume 

Xl:^c^p{p-l)-bl>0. (52) 

Then the explicit solution for z,p,q are then identical to the uncorrelated case, one just has to 
replace b by bp throughout. The explicit solution for y is modified to 



2 9 



8X^ 



(c^ {2p - 1) - 2pc6p) {2xpt - sin {2xpt)) + 2pcXp (l - cos (2Xpt))] • (53) 



In our explicit analysis of the uncorrelated case (more precisely, in solving the coupled ODEs 
zt — bzt + c^qt,qt = PtZt — Pt) — bqt) we made use of the (model) assumption 6 < 0, cf. (j49p . 
Conveniently, this remains true when p G (—1,0]. Indeed, the following lemma shows we must have 
p > 0, so that (with p < 0, c> 0) 

b = b + pep < 0. (54) 
Lemma 34 Let a > 0. Then A (a) — pa. and therefore p>Q. 
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Proof. We saw in the proof of lemma [27] that, in the uncorrelated case, as a direct consequence of 
the Hamihonian ODEs, 

p^zf + c^q^ = 2pyt + dt (ztqt) . 
The correlated case has the identical Hamiltonian ODEs provided we substitute 

b i~ b and y ^ y ^ pcztqt- 

We therefore have 

2 / z"^ pcz\ f p \ _ 2 2 , 2 2 



ho it) = (P qt) y^^^ '\,2 )\q^)= P + + 2pcp2tgi 

= 2p (yt ~ pcztqt) + dt {ztqt) + 2pcpztqt = 2pyt + dt {ztqt) 

and then conclude with the boundary data, exactly as in lemma [?7l ■ 

As already noted, & < allows to recycle all closed form expressions for z, q obtained in the 
uncorrelated case - it suffices to replace b by b. In particular, for some yet unknown p, qo which 
may and will depend on p, 

9oc^ • f A 
Zt = sm [Xpt) , 

qt = qo ^cos (xpt) - ^ sin {Xpt)^ 

where Xp ■— c^P (p — 1) — is seen to be positive as in the "uncorrelated" argument. Also, q^ ^ 0, 
seen as in the "uncorrelated" case. Transversality, qT = 0, then implies 

XpCOs{xpT)-bsin{xpT)=Q. (55) 
Introducing r := XpT the gives the equation 

r cot r = (b + pep) T. (56) 
On the other hand, from the very definition of Xp^ 'we know 

{r/T)^ ^c^p{p^l)-{b + pcp)\ (57) 

In the uncorrelated case, these two equations were effectively decoupled; in particular, r cot r = bT 
lead to r e {r'^ : fc = 1, 2, . . . } C (0, oo), written in increasing order. Since p+ was seen to be mono- 
tonically increasing in r, cf. equation (|45|) . and we were looking for the minimal p, corresponding 
to the minimal energy (cf. lemma [34]), we were led to seek the first positive root rf . (In fact, 
ri e (7r/2,7r) as we will also find in the "correlated" discussion below.) 

The correlated case is a little more complicated and we start in expressing p in equation (|56p in 
terms of r. Indeed, the quadratic equation (|57p shows 




P^W = 777^ l + 2p- ±A l + 2p- +4(l-p: 



fj2 J.2 



(58) 
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where p (r) < (and hence can be ignored in view of lemma and (r) > 0. We now look for 



r which satisfies the equation 

r cot r = {b + pcp^ (r)) T 

It is elementary to see that r cot r is non-negative on [0, tt/2] and then maps [tt/2, tt) strictly mono- 
tonically to oo, 0]. On the other hand, the map r i— >■ {b + pcp^ is < for all r; in particular, 

there will be a first intersection with the graph of r i— ^ rcotr in [7r/2, tt), say at r = rj*". Since 
p"*" (r) is plainly strictly increasing in r, the minimal p must equal to 



Pi ■=P^ {'^i) ■ 

We then proceed as in the uncorrelated case, and determine go from the boundary condition yT 
a > where y is now given by (I53|) . This leads to go G {^o^ii ^o^i} where 

± , 2 I 2r^a 



^ T3 {^{c^ {2p - 1) - 2pcbj (2r - sin (2r)) + 2pcr/r (1 - cos (2r))) 



where r — and p = p^. Again, we have two minimizing controls, /C™'" — {h(j",hQ }. We now 
have 

hoW^f^^^^^'V!) (59) 



instead of (I42p and of course lemma 1551 implies that zt and qt are fully and explicitly determined for 
each choice of {p, qo). In particular for {p, qq) <~ {pi ,qQi) rasp, (pf, q^-^) we so obtain hj resp. 
which can be written explicitly by simple substitution. Moreover, and again as in the uncorrelated 
case, 

A(a) = i||h+||l, = i||h„||l,=p+a (60) 
and upon taking a = 1 we have computed the leading order constant 

ci^A{l)=p+=p+ (r+) 



where we recall that is the first intersection point of r i— )• rcotr with (6 + pcp^ and p^ (•) 



was given in (I58|) . 

At last, we turn to the computation of the second-order exponential constant, C2. As in the 
uncorrelated case, we ease notation by writing (p, q^^ instead of (pj^, qQ^) resp.(pf , qQi) for the rest 

of this section. Again, we have to consider ODEs for (Vt, Zt^ , for each minimizer, h^j" — (hg',^ , ^q'^) 
and h|^ = (^o"'^? "''■o''^)- Recall from (|59p that, with p = p^ , 

^^W=(ppz7^cg+)' 
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where (•)^ indicates the chosen sign of qo G {qoi^Qq 1} which determines the choice of minimizer. 
We first determine Yt = Yt (h+) from the ODE 

_ (Q^{p-l)zt+pcq+\ (Yt\ ^ (Q 



with 



This aheady shows that we have the identical (closed form) ODE solution for Zt as in the un- 
correlated case. On the other hand, the form of Yt now exhibits an additional term as is seen 
in 



Yt^{p-1) [ z+Ztdt + pcf q+Ztdt 
Jo Jo 



Since q'^ is essentially of the same trigonometric form as z^ , it is clear that the explicit computations 
of the uncorrelated case extend. In the end, one finds without too much difficulties 

4 = A- (1) X Yr (K) = 4 j.o + ^If'^ I . 

A similar computation along h,^ gives = A' (1) x (hp ) in explicit form and C2 is identified as 
max(4,C2"). 

4.6.3 Checking non-degeneracy, zero and non-zero correlation 

We now check the non-degeneracy condition (ND), as introduced in definition HI which of course 
is the ultimate justification that an expansion of the form p7p with the constants computed above 
holds true. Again, focus is on the case of correlation parameter p g (—1,0]. We saw in the previous 
sections (for p — 0, then p < 0) that #iir™'" — # |h([,hQ"} = 2, whenever a > 0. (In fact, we apply 
this with a = 1.) 

Secondly, a look at p9|) reveals that the degenerate region is {{y, z) : z — 0}, the complement of 
which is elliptic. Clearly, no controlled path which reaches j/t = a > can stay in the degenerate 
region for all times t £ [0, T]; after all, this would entail dy — and hence yr — 0- We conclude the 
any ODE solution driven by /i S /Co must intersect the region of ellipticity; but this already implies 
non-degeneracy of the corresponding (deterministic) Malliavin covariance matrix. 

At last, we check non-focality and focus on hj, the other case being similar. We have to check 
non-degeneracy of the Jacobian of the map 7riJo<-T (&) S *,0), evaluated at • — zt, * — pt after 
differentiation, where zt,pt are obtained form the Hamiltonian fiow at time T, cf. lemma [33l with 
time initial data (O, 0; p^, . 

With some abuse of notation, write 
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Our non-degeneracy condition requires us to show that 



det 



( 



dp dz 
dzQ dzQ 
dp dz 



) 



^0 



(61) 



where (...) |* indicates evaluation (...) \(p.z)=(p+ .zt) ^^e sequeL This imphes in particular that all 
expressions which are formulated in terms of the solutions to the Hamiltonian flows, reduced to the 
corresponding expressions identified in proposition 1311 for p ~ 0, resp. in section [4.6.21 for p < 0. 
For instance, {yo, zq) |* = (0, 0) , j/tI* = a, z|» — zt 0, XpT\* G [tt/2, tt) and so. 

Since {z.,q.) solves a linear ODE, we can compute 



We first note that dzQ/dz\^ is zero; indeed, this follows from (|55|) . Our next claim is dyo/dz\^, ^ 0. 
Indeed, from the structure of the Hamilton ODEs, 



where (• • • ) does not depend on z. As a result Sj/o/f^-^l* = 2z (...) |* = 2 ^°~^ |„ = — 2a/z7- ^ 0. 

It remains to check that c}zo/9p|* ^ 0. To this end, recall, as a consequence of the transversality 
condition, see ([55]) . that Xp cos (XpT) — bp sin (XpT) |* = 0. It follows that 



and since z/xp\* ^ 0, it will be enough to show (strict) negativity of ^ (...) above. By scaling, 
there is no loss of generality in taking T = 1 and we shall do so from here on. Then 



Since 6p|* < and Xp\* G [7''/2,7r) we see that [...] |* < 0. Given that Xp\* > 0, this already settles 
the negativity claim in the zero-correlation case. In the case — 1 < p < 0, we use ((55|) to write 





dzo 






X'p[{l-bp) 



bp sin (xp) 



XpSin(xp)] - pcsin (xp) |,. 
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After division by sin (^Xp) /Xp\* > Oj '^^ have, using bp = b + pep < 0, 6 < and again Xp\* > 0, 

Xp[(l-^p)^p-Xp]-pcXp|* 

< Xp[(l - pep) pep - Xp] - pcxpl* 

< -pc (Xp -PXp) I*- 

With —pc > 0, it wih then be sufficient to show strict negativity of Xp — Px'pl*- To this end note 
that the definition, Xp = (?P (p ~ 1) ~ imphes 

2XpX; = c2 (2p-l)-26(pc) 
XpPXp = (?P [P - 1/2) - b [pep) 

= Xp + ^ + &6 > Xp 

whenever c^p/2 + 66 > which is surely the case upon evaluation ...|,. 

We conclude that dzQ/dp\^ ^ 0, and then validity of (pT|) . for any parameter set p e (— 1, 0], 6 < 
0, c > 0, T > 0. In other words, we have completed the check of our non-degeneracy condition. 

4.7 Comments on Heston |32] and Lions— Musiela |38| 

We recall from [27l [24] that the density of log-stock price Yt in the Heston model, 

dY = -V/2 + VVdW\ X{0) = xa=0 

dV = {a + bV) dt + c^/VdW'^, V (0) = uq > 0, 

with a>0,6<0,c>0 and correlation p G (—1,0] has the form 

/ [y) = ^-c.v^c.^y-i/i+a/c- ^ q as y oo; 

with explicitly computable ci — C'l (6, c, p, T) and C2 = ^/vq x C2 (6, c, p, T), both do not depend on 
a. While scaling with 9 — 2, 

Ye := e^Y, Ve := e^V 
indeed yields a small noise problem, namely 

dY" ^ ~Vy2 + Vv^edW\ X {0) ^ xq ^ 

dV" = (ae^ -I- bV") dt + cW^edW^, V (0) = vqe'^ > 0. 

The algebraic factor y-3/4+a/c above expansion then contradicts the expected factor; cf. 

121 

There is no contradiction here, of course. Rather, we see an explicit example where "formal" appli- 
cation of a theorem to a model which is short of the required regularity leads to wrong conclusion 
(at least at the fine level of algebraic factors) . Remark that one can trace the origin of this unex- 
pected y-3/4+a/c fa^ctor to the behaviour of the one-dimensional variance process V; also known 
as Feller - or Cox-IngersoU-Ross diffusion. Curiously then even a large deviation principle for V" 
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as given above presently lacks justification, despite the recent advances in |17) , [B]. Clearly then, 
we are not anywhere near in obtaining the Heston tail result of [27l [2l] with the present methods. 

However, in the special case when a = it is an easy exercise to see that the Heston model 
can be realized as Stein-Stein model (take V = Z^, where Z is the volatility component of the 
Stein-Stein model), the resulting expressions are then seen to be consistent with those obtained in 
[24] and, in particular, y-3/4+a/c ^ 

Another class of non-smooth, non-affine stochastic vol model with "6* = 2"-scaling, introduced 
by Lions-Musiela ^H]. For S G [1/2, 1] and j = 1 — S they consider the 2-dimensional diffusion 

dY ^-^Z^^dt + Z^dWi, Yq^O 
dZ = bZdt + cZ''dW2, Z{0) = zo>0. 
And indeed with Y^ = e^Y and Z^ = e^l^ Z this becomes a small noise problem; 

dYe = -^Zfdt + Z^edW, Y^ (0) = 

dZ, ^ hZ.dt + cZ^edZ, Z^ (0) ^ e^I^zq. 

In their paper they establish exponential moments of Yj- in the precise sense (I32p . (j33p . It is tempting 
to use corollary [TH at least to leading large deviation order, to obtain the exponential tail of Z 
for models that scale with 9 = 2. Of course, as was discussed in the Heston case, such a "formal" 
application can be wrong. Further work, building on [17], [6], will be necessary to deal with such 
degenerate models directly. 
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